The Geometric Maximum Traveling Salesman Problem 



Alexander Barvinok^ 
Dept. of Mathematics 
University of Michigan 
Ann Arbor, MI 48109-1009, USA 
barvinok@math . lsa . umich . edu 



Sandor P. Fekete* 
Dept. of Mathematical Optimization 
Braunschweig University of Technology 
38106 Braunschweig, GERMANY 
s . f ekete@tu-bs . de 



David S. Johnson 
AT&T Research 
AT&T Labs 
Florham Park, NJ 07932-0971, USA 
ds j ©research . att . com 

Gerhard J. Woeginger^ 
Dept. of Mathematics 

University of Twente, P.O. Box 217 
7500 AE Enschede, THE NETHERLANDS 

g . j . woeginger@math . utwente . nl 



Arie Tamir 
School of Mathematical Sciences 
Tel Aviv University 
Tel Aviv, ISRAEL 
atamir@math . tau .ac.il 

Russ Woodroofe 1 
Dept. of Mathematics 
Cornell University 
Ithaca, NY 14853-4201, USA 
paranoia@math . Cornell . edu 



Abstract 

We consider the traveling salesman problem when the cities are points in R d for some fixed 
d and distances are computed according to geometric distances, determined by some norm. We 
show that for any polyhedral norm, the problem of finding a tour of maximum length can be 
solved in polynomial time. If arithmetic operations are assumed to take unit time, our algorithms 
run in time 0{n^~ 2 logn), where / is the number of facets of the polyhedron determining the 
polyhedral norm. Thus for example we have 0{n 2 \ogn) algorithms for the cases of points in 
the plane under the Rectilinear and Sup norms. This is in contrast to the fact that finding 
a minimum length tour in each case is NP-hard. Our approach can be extended to the more 
general case of quasi-norms with not necessarily symmetric unit ball, where we get a complexity 
of 0(n 2 f- 2 logn). 

For the special case of two-dimensional metrics with / = 4 (which includes the Rectilinear 
and Sup norms), we present a simple algorithm with 0(n) running time. The algorithm does not 
use any indirect addressing, so its running time remains valid even in comparison based models 
in which sorting requires il(nlogn) time. The basic mechanism of the algorithm provides some 
intuition on why polyhedral norms allow fast algorithms. 
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Complementing the results on simplicity for polyhedral norms, we prove that for the case of 
Euclidean distances in M. d for d > 3, the Maximum TSP is NP-hard. This sheds new light on 
the well-studied difficulties of Euclidean distances. 

1 Introduction 

In the Traveling Salesman Problem (TSP), the input consists of a set C of cities together with the 
distances d(c,d) between every pair of distinct cities c, d £ C. The goal is to find an ordering or 
tour of the cities that minimizes (Minimum TSP) or maximizes (Maximum TSP) the total tour 
length. Here the length of a tour c^m , c^) , ■ • • j Cn(n) ^ s 



Like the Minimum TSP, the Maximum TSP is NP-complete on graphs, even if the triangle 
inequality holds. After 25 years, the best known performance guarantee for the metric Minimum 
TSP is still Christofides's 3/2 approximation algorithm; the results of Arora, Lund, Motwani, 
Sudan, and Szegedy [I] show that the problem cannot be approximated arbitrarily well, combined 
with results by Papadimitriou and Yannakakis it follows that this holds even for the special 
class of instances where all distances are 1 or 2. There has been more development on approximation 
algorithms for the metric Maximum TSP: Recently, Hassin and Rubinstein |18j have given a 7/8 
approximation algorithm. 

Of particular interest are geometric instances of the TSP, in which cities correspond to points 
in M. d for some d > 1, and distances are computed according to some geometric norm. Perhaps 
the most popular norms are the Rectilinear, Euclidean, and Sup norms. These are examples of 
what is known as an "L p norm" for p = 1, 2, and oo. In general, the distance between two points 
x = (x 1 ,x 2 , ■■■ ,x d ) and y = (yi,y2, ■■■ ,Vd) under the L p norm, p > 1, is 



This paper considers a second class of norms which also includes the Rectilinear and Sup norms, 
but can only approximate the Euclidean and other L p norms. This is the class of polyhedral norms. 
Each polyhedral norm is determined by a unit ball which is a centrally-symmetric polyhedron P 
with the origin at its center. To determine d(x,y) under such a norm, first translate the space 
so that one of the points, say x, is at the origin. Then determine the unique factor a by which 
one must rescale P (expanding if a > 1, shrinking if a < 1) so that the other point (y) is on the 
boundary of the polyhedron. We then have d(x,y) = a. 

Alternatively, and more usefully for our purposes, we can view a polyhedral norm as follows. If 
P is a polyhedron as described above and has / facets, then / is divisible by 2 and there is a set 



n-l 





with the natural asymptotic interpretation that distance under the norm is 



d(x,y) = maxf =1 |a?i — y» 
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Hp = {hi, . . . , hf/ 2 } of points in W 1 such that P is the intersection of a collection of half-spaces 
determined by Hp: 

7/2 \ ///2 

f|{x : x • hi < 1} n f|{x : x • h, > -1} 
Then we have 



d(x,y) =max{|(x-y)-h l | : 1 < t < fft} 



Note that for the Rectilinear norm in the plane we can take Hp = {(1, 1), (— 1,1)} and for the Sup 
norm in the plane we can take Hp = {(1, 0), (0, 1)}. 

For the Minimum TSP on geometric instances, two key complexity questions have been an- 
swered. As follows from results of Itai, Papadimitriou, and Swarcfiter |19j . the Minimum TSP is 
NP-hard for any fixed dimension d and any L p or polyhedral norm; see also the earlier results by 
Garey, Graham, and Johnson an d Papadimitriou On the other hand, results of Arora [3] 
and Mitchell [2S1 imply that in all these cases a polynomial-time approximation scheme (PTAS) 
exists, i.e., a sequence of polynomial-time algorithms A^, 1 < k < oo, where is guaranteed to 
find a tour whose length is within a ratio of 1 + (1/fe) of optimal. 

The situation for geometric versions of the Maximum TSP has been less clear than for its min- 
imum counterpart. Serdyukov [2Hj, and, independently, Barvinok have shown that once again 
polynomial-time approximation schemes exist for all fixed dimensions d and all L p or polyhedral 
norms (and in a sense for any fixed norm; see [£]). Until now, however, the complexity of the 
optimization problems themselves when d is fixed has remained open: For no fixed dimension d 
and L p or polyhedral norm was the problem of determining the maximum tour length known either 
to be NP-hard or to be polynomial-time solvable. In this paper, we resolve the question for all 
polyhedral norms, showing that, in contrast to the case for the Minimum TSP, the Maximum TSP 
is solvable in polynomial time for any fixed dimension d and any polyhedral norm: 

Theorem 1 Let dimension d be fixed, and let \\ ■ \\ be a fixed polyhedral norm in M. d whose unit 
ball is a centrally symmetric polyhedron P determined by a set of f facets. Then for any set of n 
points in M. d , one can construct a traveling salesman tour of maximum length with respect to \\ ■ \\ 
in time 0(n^~ 2 log n) in the real number RAM model, where arithmetic operations take unit time 
and only addition, subtraction, multiplication, and division are allowed. 

A similar result also holds for polyhedral quasi-norms, i.e., asymmetric distance functions that 
can be defined in terms of "unit balls" that are non-symmetric polyhedra containing the origin. 
These can again be characterized by a set of vectors Hp, but now we need a vector for each facet of 
the polyhedron and d(x,y) = max{(x — y) ■ h : h G Hp}. Serdyukov [2H HH] previously showed 
that for the case of a quasi-norm in M? with a triangle as the unit ball, the Maximum TSP can be 
solved in polynomial time. With our techniques we can show the following analog of Theorem^ 

Theorem 2 Let dimension d be fixed, and let \\ ■ \\ be a fixed polyhedral quasi-norm in M. d whose 
unit ball is a polyhedron P determined by a set of f facets. Then for any set of n points in 
M. d , one can construct a traveling salesman tour of maximum length with respect to \\ ■ \\ in time 
0(n 2 f~ 2 log n) in the real number RAM model. 



3 



As an immediate consequence of Theorem^ we get relatively efficient algorithms for the Max- 
imum TSP in the plane under Rectilinear and Sup norms, with a complexity of 0(n 2 logn). 

The restriction to the real number RAM model in Theorem ^ is made primarily to simplify the 
statements of the conclusions. Suppose on the other hand that one assumes, as one typically must 
for complexity theory results, that the components of the vectors in Hp and the coordinates of the 
cities are all rationals. Let U denote the maximum absolute value of any of the corresponding nu- 
merators and denominators. Then the conclusions of the Theorem hold for the standard logarithmic 
cost RAM model with running times multiplied by n\og{U). If the components/coordinates are all 
integers with maximum absolute value U, the running times need only be multiplied by log(nU). 
For simplicity in the remainder of this paper, we shall stick to the model in which numbers can 
be arbitrary reals and arithmetic operations take unit time. The reader should have no trouble 
deriving the above variants. 

The above results make use of a polynomial solution method for the following TSP variant, 
which may be of independent interest: When visiting a given set of n cities, all connections have 
to be made via a set of k hubs, where k is a constant. 

The complexity of 0(n 2 log n) for the scenario of planar rectilinear distances can be improved 
by using special geometric properties of this case. We can show the following optimal running time: 

Theorem 3 The Maximum TSP for points in M? under the L\ norm can be solved in 0(n) time. 

By appropriate coordinate transformation, this result can be generalized to all planar polyhedral 
norms with / = 4 facets, which includes the Sup norm. It holds even in a restricted model of 
computation, where no indirect addressing may be used, and hence sorting requires f2(ralogn) 
time. The main idea behind the algorithm is to exploit the fact that rectilinear distances in the 
plane have a high degree of degeneracy. As a consequence, we can show that the number of optimal 
tours is very large, 0((^!) 4 ), and the set of optimal tours can be described very easily. This 
contrasts sharply to the case of Euclidean distances, where there may be a single optimal tour. 

Indeed the complexity of the Maximum TSP in IR 2 under the Euclidean metric remains an open 
question, although we have resolved the question for higher dimensions with the following result. 

Theorem 4 Maximum TSP under Euclidean distances in M. d is an NP-hard problem if d > 3. 

One of the consequences is NP-hardness of the Maximum TSP for polyhedral norms with an 
unbounded number of facets on the corresponding unit ball. Another consequence concerns the 
so-called Maximum Scatter TSP, where the objective is to find a tour that maximizes the shortest 
edge. The Maximum Scatter TSP was first considered by Arkin, Chiang, Mitchell, Skiena, and 
Yang and the complexity for geometric instances was stated as an open problem. Our result 
implies NP-hardness for Euclidean instances in 3-dimensional space. 

An issue that is still unresolved for the Maximum TSP as well as the Minimum TSP is the 
question of whether the TSP under Euclidean distances is a member of the class NP, allowing 
polynomial time verification of a good solution. Even if all city coordinates are rationals, we do not 
know how to compare a tour length to a given rational target in less than exponential time. Such 
a comparison would appear to require us to evaluate a sum of n square roots to some precision, 
and currently the best upper bound known on the number of bits of precision needed to insure a 
correct answer remains exponential in n. 

The rest of this paper is organized as follows. Section [2] introduces a new special case of the TSP, 
the Tunneling TSP. We show how the Maximum TSP under a polyhedral norm can be reduced to 
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the Tunneling TSP with the same number of cities and k = f/2 tunnels (and with k = f tunnels 
for polyhedral quasi-norms). Section 13.11 shows how the solutions for the Tunneling TSP with a 
fixed number k > 2 of tunnels can be characterized, setting up an algorithm with a running time of 
0(n 2k ~ 2 log n) described in detail in Section [3. 21 Section 0] describes the linear-time algorithm for 
rectilinear distances in the plane. Section [5] gives the NP-hardness proof of the Maximum Traveling 
Salesman Problem for Euclidean distances in IR 3 , and a number of higher-dimensional extensions. 
Section El concludes with a brief discussion and open problems. 



2 The Tunneling TSP 

The Tunneling TSP is a special case of the Maximum TSP in which distances are determined by 
what we shall call a tunnel system distance function. In such a distance function we are given a set 
T = {t\,t2, ■ ■ ■ , tfe} of k > 2 auxiliary objects that we shall call tunnels. Each tunnel is viewed as 
a bidirectional passage having a front and a back end. For each pair c, t of a city and a tunnel we 
are given real-valued access distances F(c,t) and B(c,t) from the city to the front and back ends 
of the tunnel respectively. Each potential tour edge {c, c'} must pass through some tunnel t, either 
by entering the front end and leaving the back (for a distance of F(c,t) + B(c',t)), or by entering 
the back end and leaving the front (for a distance of B(c,t) + F(c',t)). Since we are looking for a 
tour of maximum length, we can thus define the distance between cities c and d to be 



d(c, c') = max |f(c, U) + B(c', U), B(c, U) + F(c', U) : 1 < i < £;} 



Note that this distance function, like our geometric norms, is symmetric. (As we will see below, we 
can make adjustments for asymmetric distance functions.) 

It is easy to see that Maximum TSP remains NP-hard when distances are determined by 
arbitrary tunnel system distance functions. However, for the case where k = \T\ is fixed and 
not part of the input, we will show in the next section that Maximum TSP can be solved in 
0{n 2k ~ 2 logn) time. We are interested in this special case because of the following lemma. 

Lemma 5 If || • || is a polyhedral norm determined by a set Hp of k = f/2 vectors in M. d , then for 
any set C of points in R d one can in time 0(dk\C\) construct a tunnel system distance function 
with k tunnels that yields d(c, c') =\\ c — c' \\ for all c, c' £ C . 

Proof: The polyhedral distance between two cities || c, c' ||e M. d is 



c' II = max • 



c') • h, 



1< i < k 



} 



max |(c — c') • hi, (c' — c) • hj : 1 < i < k\ 



Thus we can view the distance function determined by || • || as a tunnel system distance function 
with set of tunnels T = Hp and F(c, h) = c ■ h, B(c, h) = —c ■ h for all cities c and tunnels h. □ 

It is straightforward to extend this characterization to the case of any polyhedral quasi-norm 
that is characterized by a unit ball with a total of / facets. The only real change is in the complexity 
of the characterization of the tunnel system in Lemma |SJ If we use the definition 

d(c, c') = max |f(c, U) + B(c', U) : 1 < i < £; j 

for possibly asymmetric tunnel distances, then by similar reasoning we get the following: 
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Lemma 6 If \\ ■ \\ is a polyhedral quasi-norm determined by a set Hp of f vectors in M. d , then for 
any set C of points in M. d one can in time 0(df\C\) construct a tunnel system distance function 
with f tunnels that yields d(c, c') =|| c — c' \\ for all c, c' £ C . 
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3 An Algorithm for Bounded Tunnel Systems 



In this section we describe an 0(n 2k ~ 2 log ri) algorithm to solve the Tunneling TSP when the number 
of tunnels is fixed at k, assuming the real number RAM model. By Lemmas and El this implies 
our results for polyhedral norms and quasi-norms (Theorems ^ an d El) The approach described 
below also yields a polynomial algorithm for a Minimum TSP variant, where a given set of n cities 
has to be traveled, and all connections have to be made via a set of a fixed number k of hubs. 

We start by characterizing solutions for bounded tunnel systems in Section \3. II This character- 
ization is the basis for our algorithm, which is described in Section 1.3.21 In Section 13.31 we present 
an additional idea that may possibly improve the above complexity to 0{n 2k ~ 2 ). 

3.1 Characterizing Solutions for Bounded Tunnel Systems 

We start by discussing the solutions for bounded tunnel systems. 

Suppose we are given an instance of the Tunneling TSP with sets C = {ci, . . . , c n } and T = 
{ti, . . . ,tk} of cities and tunnels, and access distances F(c,t), B(c,t) for all c G C and t G T. We 
begin by transforming the problem to one about subset construction. 

Let G = (C U T, E) be an edge-weighted, bipartite multigraph with four edges between each 
city c and tunnel t, denoted by ei[c,t,X], i G {1,2} and X £ {B,F}. The weights of these edges 
are w(ei[c,t, F]) = F(c,t) and w(ei[c,t,B]) = B(c,t), i G {1,2}. For notational convenience, let us 
partition the edges in E into sets E[t, F] = {et[c, t,F] : c G C,i G {1, 2}} and E[t, B] = {ei\c, t, B] : 
c G C, i G {1,2}}, t G T. Each tour for the TSP instance then corresponds to a subset E' of E that 
has Yle&E' w ( e ) equal to the tour length and satisfies 

(Tl) Every city is incident to exactly two edges in E' . 
(T2) For each tunnel t G T, \E'nE[t,F}\ = \E' n E[t, B]\. 
(T3) The set E' is connected. 

To construct the multiset E' , we simply represent each tour edge {c, c'} by a pair of edges from 
E that connect in the appropriate way to the tunnel that determines d(c,d). For example, if 
d(c, c') = F{c, t) + B(c' , t), and c appears immediately before d when the tour is traversed starting 
from c^m, then the edge (c, d) can be represented by the two edges e2[c, t, F] and e±[d, t, B]. Note 
that there are enough (citytunnel) edges of each type so that all tour edges can be represented, 
even if a given city uses the same tunnel endpoint for both its tour edges. Also note that if d(c, d) 
can be realized in more than one way, the multiset E' will not be unique. However, any multiset 
E' constructed in this fashion will still have Yle&E 1 w ( e ) equal to the tour length. 

On the other hand, any set E' satisfying (Tl) - (T3) corresponds to one (or more) tours having 
length at least Y^eeE' w ( e ) : Let T' C T be the set of tunnels t with \E' n E[t, F]\ > 0. Then 
G' = (C U T',E') is a connected graph all of whose vertex degrees are even by (Tl) - (T3). 
By an easy result from graph theory, this means that G' contains an Euler tour that by (Tl) 
passes through each city exactly once, thus inducing a TSP tour for C. Moreover, by (T2) one 
can construct such an Euler tour with the additional property that if ej[c, t, x] and ej[d,t,y] are 
consecutive edges in this tour, then x ^ y, i.e, either x = F, y = B or x = B, y = F. Thus we will 
have w(ei[c,t,x]) + w(ej[d, t, y]) < d(c,d), and hence the length of the TSP tour will be at least 

X^eG-E' w ( e )' as claimed. 

In summary, our problem is reduced to finding a maximum weight set of edges E' C E satisfying 
(Tl) - (T3). 
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3.2 An Efficient Algorithm 



3.2.1 Identifiers for Subproblems 

From the previous section we know that our problem is reduced to finding a maximum weight set of 
edges E' C E satisfying (Tl) - (T3). Given that k is fixed, we will first show how to decompose the 
problem into 0(n 2fe ~ 3 ) instances, and then demonstrate how to solve each one of them in 0(n log n) 
time. 

Let E' be a subset of edges satisfying (Tl) - (T3). We associate five identifiers with E' . 

The first is T' C T, the subset of p, p < k, tunnels that are spanned by E'. (Without loss of 
generality suppose that T 1 = {t\, ...,t p }.) Globally, the total number of identifiers of the first type 
is 0(2 k ). 

Due to conditions (Tl), (T3) we know that there is a subset of 2(p — 1) edges E" C E' such 
that the subgraph G(E"), induced by E", is connected and it spans T' and exactly p — 1 cities. 
Moreover, each of these cities is incident to exactly two edges in E" , which connects two adjacent 
tunnels in T". We use the second, the third and the fourth identifiers to characterize G(E"). 

The second identifier is the spanning tree topology connecting the tunnels of T" that is induced 
by the set of edges E" . There are pP -2 identifiers of this type: the number of labeled spanning 
trees of K p , the complete graph on p nodes [Hj. 

The third identifier is an assignment of distinct cities to the spanning tree edges of the previous 
identifier. The city assigned to the tree edge joining tunnels t and t' is one that is connected by 
one edge of E" to t and by another edge of E" to t! . The total number of identifiers of the third 
type is 0{n v ~ l ). 

The fourth identifier is the subset of 2{p — 1) edges linking these cities with corresponding tunnel 
entrances. In the underlying graph G there are four edges (two pairs of identical edges) connecting 
a city with a tunnel. Therefore, the total number of identifiers of the fourth type is 0(16 p_1 ). 

The fifth and last identifier is the degree sequence of the nodes in T' induced by E' . Let 
d = (2di, 2d p ) denote this sequence of even positive degrees. Note that d\ + ... + d p = n. 
Therefore, there are clearly 0(n p ~ 1 ) identifiers of this third type. However, we will modify the 
identifier and use one of the degree entries, say d p , as an unspecified parameter. Thus, there are 
only 0{n p ~ 2 ) choices for, say, d\, cfo, d p -2- (dp-i will then depend linearly on the parameter dp.) 
Altogether, we now have 0(n 2k ~ 3 ) choices for values of identifiers. 

In summary, to prove our claim that the Tunneling TSP can be solved in 0(n 2k ~ 2 log n) time 
when k is fixed, it will suffice to show that the following problem can be solved in 0(n log n) time. 

Given a set of tunnels T' = {t±, t p }, a set of p— 1 cities, a set E" of 2(p — 1) edges connecting 
the cities to the tunnels as above, and a set of positive integers d±, d p -2, find a maximum 
weight set of edges E' , containing E" , satisfying (T1)-(T2), and such that the degree of ti is 2di, 
\<i<p-2. 

3.2.2 Solving the Subproblems 

Without loss of generality assume that the p — 1 cities connecting the tunnels in T' are {c\ , c p _i}. 
Let C = {ci, c p _i}, and C" = C — C . For each i = 1, ...,p, we split the tunnel ti into two nodes, 
tf and tf . They are called, respectively, B and F tunnel entrances. Define T' B = {tf , ...,£?} and 
T' F = {tf , t^}. 

Let G' = (C" U (Tg U Tp),E*) be an edge- weighted, bipartite multigraph with two edges, each 
of weight B(cj,ti), connecting Cj and tf, j = p,p + l,...,n, i = l,...,p, and two edges, each of 
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weight F(cj,ti), connecting Cj and tf, j =p,p+ l,...,n, i = I,..., p. 

Next, using the notation from the previous subsection, for each i = l,...,p, let /• = \E" n 
E[U,B]\ and ff = \E" n E[t h F]\. (Note that ff (ff) is the number of "back" ("front") type 
edges in E" that are incident to tunnel ij.) 

Following is a formal description of the maximization problem: 

p n p n 

g(d p ) = max(^ ^ B( Cj ,ti)xfj + ^ F ( c ^ t i) x f,j) 

8=1 j=p i=\ j=p 

subject to 

n 

Y,xf J = d i -f l B , i = l,...,p-2,p 

j=p 

n 

^2 x f,j = di - ff , i = l,...,p-2,p 



n p—2 

^2 x p-i,3 = n ~ dp — fp-i — ^2 di, 

j=p i=l 
n p—2 

x p~ 1 J = n ~ dp — fp-i — ^2 dii 

j=p i=l 
P 



^2{xfj + xfj) = 2, j = p, n, 
i=l 

xfj > 0, i = l,...,p, j =p,...,n, 
xfj > 0, i = j =p,...,n. 

It is easy to see that for each integer value d p £ {l,2,...,n}, the above problem is an instance 
of the classical transportation problem with n — p + 1 sources (cities) and 2p destinations ( j) B 
tunnels and p F tunnels). Therefore there is an optimal solution to the linear program for each 
integer value of d p in which all variables are integer. Moreover, since the number of destinations is 
fixed (p < k), when d p is specified the dual of this transportation problem can be solved in 0(n) 
time by the algorithm in Zemel |33j . (See also Megiddo and Tamir .22..) In particular, g(d p ) can 
be computed by solving the dual in 0(n) time. Viewing d p as a (real) parameter, we note that 
g(dp), the optimal objective value of the above transportation problem, is a concave function of d p . 
To see this, observe that if d < d' are legal values for d p , and Xd and x~d' are the optimal solutions 
for these two values viewed as vectors of variable values, then (xd + Xd')/2 is a feasible solution for 
{d + d')/2 and so g((d + d')/2) > (g(d) + g{d'))/2. Thus, we can apply a Fibonacci search over the 
integers {l,2,...,n}, or alternatively perform a binary search, at each step checking the values at 
d—1, d, and d+l. (Actually, d p is restricted by 1 < d p < n — Yli=i di-) Specifically, by computing 
the function g(d p ) at O(logn) values of d p we obtain the (integer) value of d p maximizing g(d p ). 
Thus, in 0(n log n) time we find the best value of d p . 

Therefore, in time 0(n 2fc ~ 3 • nlogn) = 0(n 2k ~ 2 log n) we can determine the set of identifiers 
for the transportation problem that yields the maximum solution value (including the full degree 



9 



sequence for the tunnels). This solution value will be the length of the maximum length tour, but 
because we were solving the dual rather than primal LP's, we won't yet have the optimal tour itself. 
For this, we need only solve the optimal transportation problem directly in its primal form. Since 
the number of tunnels is bounded, we can do this in deterministic time 0(n) using an algorithm 
described in an unpublished paper by Matsui The basic idea is to apply complementary 

slackness properties to the already computed dual solution variables to reduce the problem to a 
network flow problem for a bounded number of sinks. This can then be solved in linear time using 
an algorithm of of Gusfield, Martel and Fernandez-Baca (see also Ahuja, Orlin, Stein and 
Tarjan pQ). Alternatively, one can use the published algorithm of Tokuyama and Nakano [3X] that 
requires time 0(n log 2 n) time. In either case, the time to find this one primal solution is dominated 
by the time already spent to solve all the duals, so the overall running time bound is that for the 
latter: 0(n 2k ~ 2 log n). 

3.3 Further Improvement 

One way to improve upon the above bound is to reduce the running time to solve an instance of 
the parametric transportation problem from O(nlogn) to 0(n). At this point we do not know how 
to achieve the linear bound, but we feel that the following approach might be fruitful. 

Consider the above parametric transportation problem, and view the parameter d p as an addi- 
tional (real) variable. The resulting linear program, which we call the primal, is not a transportation 
problem. Nevertheless, the algorithms in Zemel [33] and Megiddo and Tamir [22] can still solve the 
dual of this linear program in 0(n) time. The missing ingredient at this point is how to use the 
dual solution to obtain (in linear time), an optimal solution to the primal program. Specifically, 
we only need to know the optimal (real) value of the primal variable d p , say d*. If d p is known, we 
can use the concavity property of the function g{d p ) to conclude that the optimal integer value of 
d p is attained by rounding d* up or down. We can then proceed as above. 

4 An 0(n) Algorithm 

In this section, we describe a linear-time algorithm for determining the length of an optimal tour for 
the Maximum TSP under rectilinear distances in the plane. This amounts to a proof of Theorem 
structured into a series of Lemmas and stretching over Subsections 14.11 to 14.31 At the end of the 
section we sketch how this result can be extended to any 4-facet polyhedral metric. Throughout 
this section we assume without loss of generality that n > 4, as otherwise the problem is trivial. 

4.1 Stars and Matchings 

Our construction uses properties of so-called stars; a star for a given set of vertices V is a minimum 
Steiner tree with precisely one Steiner point (the center) that contains all vertices in V as leafs. 
The total length of the edges in a star is an upper bound on any matching in V, since any edge 
(vi,Vj) in the matching can be mapped to a pair of edges (vi,c) and (c,Vj) in the star, and by 
triangle inequality, d{vi,Vj) < d((vi,c) + d(c,Vj). The worst case ratio between the total length 
of a minimum star min5(P) and a maximum matching maxM(P) plays a crucial role in several 
different types of optimization problems. See the paper by Fingerhut, Suri, and Turner |14j for 
applications in the context of broadband communication networks. Also, Tamir and Mitchell 30 
have used the duality between minimum stars and maximum matchings for showing that certain 
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matching games have a nonempty core. The value of the worst case ratio under the Euclidean 
metric has been determined by Fekete and Meijer; see for this result and several extensions. 

In the rest of this section, all distances are planar rectilinear distances, unless noted otherwise 
at the end of the section. For rectilinear distances, determining the length minS*(P) of a minimum 
length star (also known as the rectilinear planar unweighted 1-median or rectilinear Fermat-Weber 
problem) can be done in linear time. 

Lemma 7 Suppose we are given a given set of n points P = {pi, . . . ,p n } with p, = (xi,yi), 
1 < i < n. Then under L\ distances there is an optimal star center c = (x c ,y c ), where x c is a 
median of {xi \ i = 1, . . . , n}, and y c is a median of {yi \ i = 1, . . . ,n} . 

This is problem 9-2e in |1U| . Recall that if n = 2m is even, then both the m'th and (m + l)'st 
largest items are medians.) The proof is immediate; for an 0(n) running time, use the result by 
Blum, Floyd, Pratt, Rivest, and Tarjan for computing a median of n numbers. 

It should be noted that for Euclidean distances, the problem of determining minS'(P) is con- 
siderably harder: It was shown by Bajaj 5 that the problem of finding an optimal star center for 
five points in the plane is in general not solvable by radicals over the field of rationals. This implies 
that an algorithm for computing min S(P) must use stronger tools than constructions by straight 
edge and compass. 




Figure 1: The four quadrants and their point sets. 

The following Theorem |H] appears in the paper by Tamir and Mitchell |3U| as Theorem 8; 
independently, it was noted by Fekete and Meiier|T5]. 

Theorem 8 For \P\ even, we have maxM(P) = minS'(P) for rectilinear distances in the plane. 

The basic idea is that the coordinates of an optimal star center subdivide the plane into four 
quadrants. If ties are broken in the right way, the number of points in opposite quadrants is the 
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same. See Figure ^ Then 

Li(vi,Vj) = Li(«i,c) +Li(c,Vj) (1) 

holds for any edge (vi,Vj) in the matching, and the theorem follows easily. □ 

Since the results of this paper include the case in which |P| is odd, we formalize and generalize 
this observation. 

Let x c and y c be as in Lemma [7] and define P~ := {pi £ P \ Xi < x c }, := {xi \ xi = x c }, 
p x '■= {Pi £ P\xi> x c }, Py := {pi e P\yi < y c }, P° := {yi \ y t = y c }, and P+ := {p { G P \ y { > 
y c }, From the above conditions, it follows that 

Tl 

n~: = l^~l<2 ( 2 ) 
nt- = l^ + l<^ (3) 

Tl 

= \Py\<2 (4) 



P v + \ < (5) 



v ■ ' y i - 2 



Let n° x := \P®\, and n° := |P° |. By picking any subset of P® of size [~^] — n~ and joining it with 

P~, we get a set P x ^° of size [~^~|; the remaining [§J points form the set Px^ + ■ Similarly, we 

get the partition into P y ^° of size [f], and Py l+ of size [§J • Define the following quadrant sets: 

P- := P- /0 n P y /0 , P-+ := P x - /0 n P° /+ , P+~ := P x ° /+ fl P y /0 , and P++ := P° /+ fl P° /+ . The 
sets P and P ++ are opposite quadrant sets, as are P ^ and P" 1 . Two quadrant sets that are 
not opposite are called adjacent. Finally, let n := |P |, etc. We get the following conditions: 

Lemma 9 If n is even, then opposite quadrant sets contain the same number of points, i. e., 

n = n ++ (6) 
n~ + = n + ~. (7) 

If n is odd, the numbers of points must satisfy 

n" = n ++ + l (8) 
n~ + = n + ~. (9) 

Proof: From the definition of the quadrant sets, it follows for even n that 

n + n ~ + =n + ~+n ++ (10) 
n + = n + n ++ . (11) 

From this the claims © and (J7J) follow easily, as was noted in |3Uj and |13| . 
For the odd case, the definition of the quadrant sets yields the conditions 

n + n~+ = n + ~+n ++ + l (12) 
n + n + ~ = n~ + + n ++ + l. (13) 

This implies © and ©. □ 

In the following, we will use Lemma to derive first an optimal 2-factor, consisting of at most 
two subtours, and then argue how these subtours can be merged optimally. 
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4.2 2-Factors and Trivial Tours 



A 2-factor for a set of vertices is a multi-set of edges that covers each vertex exactly twice. Since 
any tour is a 2-factor, a maximal length 2-factor is an upper bound for the length of a tour. Using 
the triangle inequality, it is straightforward to see that twice the length of a star is an upper bound 
for the length of any 2-factor, even when the star is centered at one of the given vertices. Achieving 
tightness for this bound is the main stepping stone for our algorithm. Based on the results of the 
preceding section, we prove the following three lemmas. We start with the easiest case: 

Lemma 10 If two of the quadrant sets are empty, then there is a feasible tour of length 2mmS(P), 
which is optimal. 

Proof: Note that the conclusion will follow if we can construct a tour in which all edges satisfy 
property Q above. If two of the quadrant sets are empty, it follows from LemmaElthat these must 
be opposite. Without loss of generality, let us assume that they are P~~ and P +,+ . For the other 
two sets, any edge (vi,Vj) between opposite quadrant sets satisfies property (0). If the number of 
points in the two opposite quadrant sets is the same, we can get a tour by jumping back and forth 
while there are unvisited points in these quadrant sets. If n is odd and p* = (x c ,y c ) E P , then 
can be inserted into any of these edges, while ® will still apply to all edges. If n is odd and 
P* = (xc,Uc) P , then P must contain two points, one with x c as its x-coordinate and one 
with y c as its y-coordinate. The tour connects these two together and then jumps back and forth 
between quadrants. All edge lengths again satisfy (0). □ 

Lemma 11 Suppose no quadrant set is empty, n is odd and \P®UPy | > 1. Then there is a feasible 
tour of length 2min S(P), which is optimal. 

By conditions |2] and 01 and because x c and y c are both coordinates of points, we know that 
P® n P x ^° and Py n P y ^° each must contain a point. As |P° U P°| > 1, we can consider two points 
p a € P°, Pb € Py with a ^ b. Let us assume without loss of generality that when we constructed 

P x ^° and P y ^° we assigned p a to the former and pb to the latter. 
We distinguish the following cases - see Figure [21 

(a) p a ,p b e P : 

By connecting p a with a point in P , and pb with a point in P" 1 , and otherwise jumping 
back and forth between opposite quadrant sets, we get a tour that satisfies (0) for any edge. 

(61) Pa <£ P—, Pb G P— : 

In this case, p a £ P <r . By changing the membership of pb from P to P '", we get |P | = 
|P ++ | and |P h | = |P H | + 1. Then a tour for the modified P *~ and P" 1 can be obtained as in 
case (a). 

(62) p a e P—,p b £P—: 

This is treated in the same way as case (bl). 

(c) Pa,Pb £ P : 

In this case, p a £ P ^ and pb G P^ . By changing the membership of p a from P to P ++ 
and the membership of pb from P^ to P ++ , we get |P ++ | = |P | + 1 and |P ^| = |P^ |, so we 
can get tours as in case (a). □ 
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(a) (b) (c) 

Figure 2: Getting optimal 2-factors. 

In the remaining cases we may no longer be able to get a tour that meets the 2min5*(P) upper 
bound, but the next Lemma says that we can construct two disjoint tours whose total length meets 
this bound. We will subsequently show how these tours can be combined with only a small decrease 
in total length to obtain a single optimal tour. 

Lemma 12 Suppose no quadrant is empty and either n is even or n is odd and \P° U P°| = 1. 

Then there is a tour T / ++ of the points in P U P ++ , and a tour T ^/^ of the points in 

p-+\jP + ~, such that £(T__ /++ ) + £(T_ +/+ _) = 2mmS(P). 

Proof: If n is even, we can argue like in the proof of Lemma lTUl We get two subtours, one covering 
each pair of opposite quadrant sets. 

If n is odd and there is only one point p* in P® U P® , the case reduces to n even, since 
P* = (c x ,c y ) £ P , and can be inserted into any tour of P \ {p*} and P ++ while still 
guaranteeing (^Q) for any tour edge. □ 

4.3 How to Merge 2-Factors 

Suppose that no quadrant set is empty and that we have a pair of subtours, whose total length 
matches the 2minS'(P) upper bound on optimal tour length, as in Lemma IT2l Now we shall show 
how the upper bound has to be adjusted if we are to restrict ourselves to connected tours, and how 
the adjusted bound can be met. 

We start with the easier case of n odd and the median being part of the point set, before dealing 
with the more complicated case of even n. 

4.3.1 Odd n 

Let n be odd and \P® U P®\ = 1, which means that p* = (x c , y c ) is in P. 

Lemma 13 Let n be odd, \P® U P®\ = 1, and all quadrant sets be nonempty. Then any tour of P 
contains an edge that connects two adjacent quadrant sets and is not incident on p* . 
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Proof: Any tour T of P induces a tour T' on the set P\{p*}; T' must contain at least two different 
edges ei and e2 that connect adjacent quadrant sets, i. e., that connect S\ = (P \ {p*} U 
to S2 = (P h U P -1 ) = P \ S%. One of these two edges must also be part of T, and the claim 
follows. □ 

Lemma 14 Let e\ = (pi,P2) be an edge connecting two horizontally (or vertically) adjacent quad- 
rant sets. Let pi = (xi,yi), and define z := min{ | y c — yi \, \ Vc — y2 I } (or z := mm{\x c —xi\, \x c —X2\})- 
Then any tour containing e\ has length at most 2minS'(P) — 2z. 

Proof: In either case, we have L\(jp\,p2) = Li(pi, c) + L\(c,p2) — 2z, and the claim follows. □ 

By considering all possible edges (pi,p2) connecting adjacent quadrant sets, we get an adjusted 
upper bound on the tour length. Note that only the smaller distance from a median line matters 
for this bound. More formally, let Z% = min{|y c — yi\ \ pi £ P \ {p*}}, and Z2 = min{|x c — Xi\ \ 
Pi S P\ {p*}}, and let Z* = min{Zi, Z 2 }. 

Lemma 15 Let n be odd, \P®L)Py \ = 1, and all quadrant sets be nonempty. Then an optimal tour 
of P has length min S(P) — 2Z*, and such a tour can be found in linear time. 

Proof: By Lemma ^] and Lemma ITU min5'(P) — 2Z* is a valid upper bound on the tour length. 
It follows from the discussion of S(P) and the definition of Z* that the bound can be computed in 
linear time. Finally suppose for example that = Z\ and that p\ £ P is a point for which the 
value of Z\ is met. Connect p\ to a vertex in P" 1 , and p* to vertics in P ++ and P . As shown 
in Figure El it is straightforward to add only edges between opposite quadrants in order to get a 
tour of the required length. Other cases are handled analogously. □ 




Figure 3: Getting an optimal tour for odd n, when the median is in P. 



4.3.2 Even n 

The proof proceeds similarly to the case for odd n but requires a more involved combinatorial 
analysis. Let us say that a pair of edges e\ = (^1,^2) and e2 = (^3,^4) is a quadrant matching of 
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the first type if v\ G P , v 2 G P~ + , v 3 G P + ~, t> 4 G P ++ or if G P , W2 G P + ~ , w 3 G P~ + , 
t>4 G P ++ . See Figure 4. We will call ei and e 2 a quadrant matching of the second type if one edge 
joins adjacent quadrants and the other lies within a third quadrant. 




(b) (c) 

Figure 4: (al+2) Quadrant matchings of the first and second type, (b+c) Edges connecting adjacent 
quadrant sets. 

Lemma 16 Let n be even and all quadrant sets be nonempty. Then any tour of P contains a 
quadrant matching of either the first or the second type. 

Proof: As in the proof of Lemma IT^l any tour of P must contain at least two different edges e% = 
(vi,V2) and e 2 = («3,U4) that connect adjacent quadrant sets, i. e., that connect Si = (P UP ++ ) 
to S 2 = (P-+UP+-) =P\S t . 

We consider the following cases; in all cases, we name particular choices of quadrants for clearer 
notation and better reference to Figure 21 All other choices are completely analogous. Moreover, 
all arguments remain valid if two of the named vertices coincide. 

(a) (1)1,1)2), (t>3 5 i>4) form a quadrant matching of either type: 

In this case, there is nothing to prove. 

(b) 1)2)5 ( v 3i v 4:) connect one quadrant with both adjacent quadrants: 

Suppose without loss of generality Vi,v 3 G P , v 2 G P^ , V4 G P h , as shown in the figure. 
Since two edges adjacent to vertices in P are already given, there can be at most 2n — 2 = 
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2n ++ — 2 edges between P and P ++ , so there must be an edge connecting two vertices in P ++ , or 
two edges connecting P ++ to adjacent quadrant sets. In the first case we have a quadrant matching 
of the second type and in the second case we have a quadrant matching of the first type, so the 
claim follows. 

(c) (^1,^2)? (^3,^4) connect the same pair of adjacent quadrants: 

Suppose without loss of generality vi,v% G P , v 2 ,V4 G P + ~ . Since two edges adjacent to 
vertices in P -1 are already given, there can be at most 2n H — 2 = 2n h — 2 edges between P 
and P H , so there must be an edge connecting two vertices in P h (and we are done), or there 
must be two edges between P~ + and adjacent quadrant sets, either yielding a quadrant matching 
of the first type (and we are done), or reducing this case to case (b). □ 

Now we can give an upper bound on the length of an optimal tour with a given pair of edges. 

Lemma 17 Let e\ = (pi,p 2 ) be an edge connecting two adjacent quadrant sets, say, P and 
P H . Let e2 = (P3,P4,) be an edge forming a quadrant matching with e±. Let pi = (xi,y.i), and 
define z x : = min{(y c - y ± ), (y c - y 2 )}, z 2 := min{(y 3 - y c ), (y 4 - y c )}. 

Then any tour containing e\ and e 2 has length at most 2min5(P) — 2z\ — 2z 2 - 

Proof: Since L 1 (p 1 ,p 2 ) = L 1 (p 1 ,c) +L 1 (c,p 2 ) - 2zi, and L 1 (p 3 ,p /L ) < L 1 (p 3 ,c) + Li(c,p 4 ) - 2z 2 , 
the claim follows. □ 

By considering all pairs of edges, we get an adjusted upper bound on the tour length. For this 
purpose, let Z\ = min{|y c — yi\ | Pi £ P U P^ }, and let Z 2 = min{|yj — y c \ \ pi G P ^ U P ++ }. 
Similarly, let Z3 = min{|2; c — Xi\ \ pi £ P UP *~}, and let Z4 = min{|xj — x c \ \ pi G P^ UP ++ }. 
Finally, let = min{Zi + Z2, Z3 + Z4}. 

Lemma 18 Let n be even, and all quadrant sets be nonempty. Then an optimal tour of P has 
length min S(P) — 2Z*, and such a tour can be found in linear time. 




Figure 5: Getting an optimal tour for even n. 
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Proof: It follows immediately from Lemmas 1161 1171 and the definition of Z* that minS'(P) — 2Z* 
is a valid upper bound that can be computed in linear time. To see that there is a tour of this 
length, consider a pair of vertices where the value Z* is met. Without loss of generality, let this be 
for pi E P and P2 G P ++ . Connect pi to any vertex in P" 1 , and P2 to any vertex in P ^ . Now 
it is easy to see that using only edges connecting opposite quadrant sets, we can get a tour. See 
Figure [3 □ 

This concludes the proof of Theorem |21 

4.4 An extension 

Using a rotation by vr/4, it is easy to transform distances to L\ distances, so the theorem 
remains valid for this case. Moreover, any 4-facet polyhedral metric can be transformed into the 
L\ metric with an appropriate coordinate transformation, turning the unit ball into a square. See 
Figure El for an illustration. All arguments described in the preceding section can still be applied 
for these transformed coordinates, so the following generalization holds: 

Theorem 19 For any 4-facet polyhedral metric in the plane, a tour of maximum possible length 
can be constructed in linear time. 




Figure 6: Transforming an arbitrary 4-facet norm into the L\ norm. 

We conclude this section by noting that TheoremEldoes not appear to extend to two-dimensional 
metrics with more than four facets, the two-dimensional Euclidean metric, or the rectilinear metric 
in higher dimensions. 
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To see that there is no easy generalization even for L\ distances in higher dimensions, note that 
the partition into orthants by an optimal star center may not induce a "balanced" partition of the 
point set, such that we have subsets of equal size in opposite orthants. 

Example 20 Consider P with points in each of the orthants {q = (x,y, z) \ x > 0, y > 0, z > 
°}> U = (x,y,z) | x < 0,y < 0,z > 0}, {q = (x, y, z) \ x < 0, y > 0, z < 0}, {q = (x,y,z) | 
x > 0,y < 0, z < 0}, plus the point (0,0,0). Then (0,0,0) is the unique optimal star center. No 
connection of points in different orthants keeps the triangle inequality tight. 

That there is a fundamental difference between the Euclidean and rectilinear metrics for the 
plane is clear from the following. 

Corollary 21 For any set of n points in the plane, there are Q. ((^!) 4 ) man V tours which are 
optimal for the Maximum TSP under rectilinear distances. If the distances are Euclidean, there 
may only be one optimal tour. 

Proof: Any tour that can be constructed as in Lemma ITU1 Lemma IT51 or Lemma fTHl is optimal, so 
we can choose an arbitrary permutation for each quadrant set. This yields the above lower bound 
on the number of optimal tours. Conversely, we see that any optimal tour must have the structure 
described in the Lemmas. 

To see that there may only be one optimal tour for Euclidean distances, consider a set of 
n = 2k + 1 points that are evenly distributed around a unit circle. □ 

5 NP-Hardness Results 

Now we proceed to show that changing from polyhedral to Euclidean distances, or to distances in 
spaces of unbounded dimension, changes the problem complexity from polynomial (or even linear) to 
NP-hard. This dramatic effect illustrates that failing to model geometric instances of optimization 
problems beyond their combinatorial graph structure can miss out on important differences in 
problem complexity. 

5.1 Euclidean Distances in 3D 

In this section, we establish the NP-hardness of the Maximum TSP under Euclidean distances in 
M. d . The proof gives a reduction of the well-known problem Hamilton Cycle in Grid Graphs, which 
was shown to be NP-complete by Itai, Papadimitriou, and Swarcfiter PJi]]. A grid graph G is given 
by a finite set of vertices V = {v\ V2, ■ ■ ■ , v n }, with each vertex Vi represented by a grid point 
(xi,yi) £ Z 2 ; for easier notation, we write V{ = (xi,yi). Two vertices Vi and Vj in G are adjacent 
if and only if they are at distance 1, i. e., if (xi — Xj) 2 + (yi — yj) 2 = 1. Without loss of generality, 
we may assume that G is connected, and that n is sufficiently large. Note that any grid graph is 
bipartite: vertices Vi with Xi + yi even can only be adjacent to vertices Vj with Xj + yj odd, and vice 
versa. In the following, we will denote this partition by V = V e L)V , where V e is the set of vertices 
with even coordinate sum, while V Q is the set of vertices with odd coordinate sum. 

The basic idea of the proof is to embed any grid graph G into the surface of a sphere in M 3 , 
such that edges in the grid graph correspond to longest distances within the point set. This can be 
achieved by representing the vertices in V e by points that are relatively close to each other around 
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a position (a, b, c) on the sphere, and the vertices in V by points close to each other at a position 
on the sphere that is roughly opposite (i.e., antipodal) to (a,b,c); for simplicity of description by 
spherical coordinates, we will use positions that are close to the equator. Locally, the mapping of 
the two point sets onto the sphere is an approximation of the relative position of vertices in the grid 
graph. Since adjacent vertices in a grid graph have different parity, unit edges in the grid graph 
representation correspond to edges connecting points that are almost at opposite positions on the 
sphere, and vice versa. 

In the following, the technical details are described. For simplicity, we use spherical coordinates 
and multiples of ir. However, it will become clear from our discussion that we only require compu- 
tations of bounded accuracy. It is straightforward to use only Cartesian coordinates that can be 
obtained by polynomial time approximation within the desired overall error bound of O (fi -8 ) for 
the length of an edge. 

Represent each vertex V{ by a point S(vi) on the unit sphere, described by spherical coordinates 
(r, <f>, 9), which translate into Cartesian coordinates by x = r cos ^ cos 9, y = r sin (p cos 9, z = r sin 9. 
Note that, as in standard geographic coordinates, the "equator" of the sphere is given by 9 = 0; 
the angle 9 describes the "latitude" of a point, while <j> describes the "longitude". Since we will 
only consider points with r = 1, we will simply write ((f>, 9) in spherical coordinates, but (x,y,z) 
in Cartesian coordinates. Let tp = ^j. Now any vertex V{ £ V e is represented by a point S(vi) = 
(xiip,yiip). Any vertex Vi G V Q is represented by a point S(vi) = (ir + Xiip, —yiip). 

Lemma 22 There is a small constant e n = O (n~ 8 ) , which can be computed in polynomial time, 
such that for the three-dimensional Euclidean distance L2 between two points S(vi) and S(vj), the 
relation L2 (S(vi), S(vj)) > 2 — ^ — e n holds if and only if vi and Vj are adjacent in G. In that 
case, L2 (S(vi), S(vj)) < 2 — ^- + e n . If Vi and Vj are not adjacent in G, then L2 (S(vi), S(vj)) < 

Proof: Since the diameter of the grid graph cannot exceed n, it is easy to see that we have 
L<2 (S(vi), S(vj)) < nip = O (n -2 ) whenever vi and Vj have the same parity. Therefore consider 
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Vi £ V e and G V^. Then 

[L a (5K),S(^))] 2 
= [L 2 ((cos(xiV) cos(yiV), sin(xiV) cos(y^), sin(y^)) , 

(cos(7r + ajji/') cos(— yjV"), sin(7r + XjVO cos(— yjip), sin(— yjVO))] 2 
= [cos(xjV0 cos(yiip) + cos(xjV) cos(yj^)] 2 + [sin(xj^) cos(yiV) + sin(xjV>) cos(yjV)] 2 

+ [sin(y;V0 + sin(yjV)] 2 



+ 0{{x^) A 



+ 1 



+ 0((x^) 4 ) 



2 

M) 2 



+ o((y^) 4 ) 
+ o(M0 4 



+ 



[xtf - O ((x^) 3 )) ( 1 - ^ + O ((y^f) 



M - O ((x^) 3 )) ( 1 - + O (M) 4 : 



n 2 



+ [yrt - O {{y^f) + yrf - O ((y,^) 3 )] 2 

(x,v0 2 M) 2 {x^f (yjip) 2 



+ 0{n- 8 ) 



H 2 



2 2 2 2 

+ [xiif> + Xjifi + O (n~ 6 )] 2 + 

+ [ yi if; + y^ + 0(n- 6 )] 2 + 
= [4 - 2(x^) 2 - 2(y^) 2 - 2(x,^) 2 - 2(y^) 2 + O (n" 8 )] 

+ [(x^) 2 + {Xji>) 2 + 2x i x i V 2 + O (n- 8 )] + [{y^f + (y^) 2 + 2y l2/j V 2 + O (n~ 8 )] 
= 4 - (xt - x,-) V - ( yi - yj ) V + O (n~ 8 ) . 

Since Vi and Vj have different parity, we have (xj — Xj) 2 ip 2 + (y, — yj) 2 ?p 2 = ip 2 if and Uj are 



adjacent in G, and (xj 
claim follows. □ 



/) ip + (yi — yj) ip > 5ip if Vi and Vj are not adjacent in G, so the 



From Lemma 1221 it is straightforward to conclude that there is a tour of length at least 2n — 
n% — ne n , if and only if the grid graph G is Hamiltonian. 

By setting additional coordinates equal to zero, we conclude the proof of Theorem^Jfor arbitrary 
dimensions d > 3. 



5.2 Higher-Dimensional Implications 

There are various implications of Theorem |1] to higher dimensions. It is not hard to generalize the 
construction and the proof of Lemma|22]to the case of L p norms, as long as p {1, oo}: Instead of 
using a unit sphere for the embedding, use an L p ball of dimension 3, which has a smooth surface 
whenever p ^ 1, oo. The error bounds can be worked out in an analogous way. 
Combined with Theorem ^ we note: 

Corollary 23 Provided that P^NP, the Maximum TSP under an L p norm in M. d with d > 3 fixed 
is solvable in polynomial time, if and only if p € {1, oo}. 
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There is a close connection between the Euclidean norm and polyhedral norms when the number 
of facets k is not fixed, as was pointed out by Joe Mitchell [22]: Since we only need to consider 
O (n 2 ) directions for connections between points, we can replace the Euclidean distances L2 by a 
polyhedral norm with O (n 2 ) facets. 

Corollary 24 The Maximum TSP under a polyhedral norm having a unit ball with k facets in M. d 
is an NP-hard problem, if d > 3 and k is part of the input. 

Another easy consequence concerns the Maximum Scatter TSP, which was first considered by 
Arkin, Chiang, Mitchell, Skiena, and Yang [2]. In this problem, the objective is to find a tour 
that maximizes the length of the shortest edge. Arkin et al. gave an NP-hardness proof for the 
general case and a 2-approximation that uses only triangle inequality. The complexity for geometric 
instances was left as an open problem. Using the above construction and Lemma 122] we get: 

Corollary 25 The Maximum Scatter TSP under Euclidean distances in M. d is an NP-hard problem 
ifd>3. 

Finally, it is straightforward with the above construction to show the following: 

Corollary 26 The Maximum TSP and the Maximum Scatter TSP under shortest distances on the 
(d — 1)- dimensional surface of the d-dimensional unit sphere S^ 1 are NP-hard for d > 3. 

It was also noted by Joe Mitchell that another corollary can be derived by using an approxi- 
mation with O (n 2 ) facets: 

Corollary 27 The Maximum TSP and the Maximum Scatter TSP on the (d — 1)- dimensional 
surface of a d-dimensional convex poly tope with an unbounded number of facets under geodesic 
distances are NP-hard for d > 3. 

Another set of questions concerns the complexity of the Maximum TSP when d is not fixed. It 
is relatively easy to show that the problem is NP-hard for L p norms: 

Theorem 28 The Maximum TSP and the Maximum Scatter TSP are NP-hard for points in d- 
dimensional space under L p norms, 1 < p < 00, when d is part of the input. 

Proof: We use a transformation from the Hamiltonian Circuit problem for simple cubic graphs 
G = (V,E), which was shown to be NP-complete by Garey, Johnson, and Tarjan [I^]. We use a 
separate dimension for each edge e E E, and a point pi for each vertex E V: For edge e = (uj, Vj), 
choose the rc e -coordinate of one of the points (say, pi) to be 1, and the x e -coordinate of the other 
point (say, pj) to be —1. All other x e -coordinates are chosen to be 0. This means that the L p - 
distance between points representing adjacent vertices in G is the "long" value L max := $/2P + 4 
(2 for -Lqo), while nonadjacent vertices get the "short" distance L mm := ^6 (1 for Loo). For fixed 
p > 1, it is straightforward to compute a critical value L < L max in polynomial time such that 
there is a tour of length nL or greater if and only if there is a Hamiltonian cycle in G. □ 

This leaves open the case of the L\ metric when d is not fixed, although we conjecture that 
the L\ case is NP-complete as well. Also open is the question of whether there might be a PTAS 
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for any such norm when d is not fixed. Trevisan |32| has shown that the Minimum TSP is Max- 
SNP-hard for any such norm, and so cannot have such PTAS's unless P = NP. We can obtain 
a similar result for the Maximum TSP under Lqo by modifying our NP-hardness transformation 
to prove Max-SNP-hardness and hence by [I] the existence of an e such that no polynomial time 
approximation algorithm can guarantee a solution within a factor of 1 + e of optimal unless P = 
NP. 

Theorem 29 The Maximum TSP and the Maximum Scatter TSP are Max-SNP-hard for points 
in d-dimensional space under Loo-norms when d is part of the input. 

Proof: The source problem is the Minimum TSP with all edge lengths in {1,2}, a special case 
that was proved Max-SNP-hard by Papadimitriou and Yannakakis We use the construction 
of our previous proof, with edges of length 1 as "edges," edges of length 2 as "non-edges," and each 
"edge" having its own coordinate in \E\ -dimensional space. For this coordinate, one endpoint gets 
value +1, the other gets —1, and all other points get value 0. Thus, adjacent vertices in G get 
mapped to points at the "long" Loo-distance 2, while each pair of vertices at distance 2 in G gets 
mapped to a pair of points at "short" Loo-distance 1. Therefore, long tours in the constructed point 
set correspond to short tours in the original graph. Now the Max-SNP-hardness is immediate, as 
a Maximum TSP tour of length at least (2 — e)n, i.e., with at most en short edges, corresponds to 
a Minimum TSP tour of length at most (1 + e)n, i.e., with at most en edges of length 2. □ 

The question remains open for L p , 1 < p < oo, although we conjecture that these cases are 
Max-SNP-hard as well. 

6 Conclusion 

We have derived polynomial time algorithms for the Maximum TSP when the cities are points in M. d 
for some fixed d and when the distances are measured according to some polyhedral norm or quasi- 
norm, with running time 0(n k ~ 2 log n) for norms based on fc-facet polyhedra and 0(n 2k ~ 2 logn) 
for quasi-norms based on k-f&cet polyhedra. Our approach is based on a solution method for the 
Tunneling TSP; we believe that the related Minimum TSP variant with city connections via a fixed 
set of hubs is of independent interest. We also gave an optimal 0(n) algorithm for the special case 
of 4-facet polyhedra in the plane, such as the rectilinear norm. We suspect it may be possible to 
improve on our complexity for L\ distances in IR 3 by using some of our geometric ideas. (Since 
the unit ball for L\ distances in R 3 is an octahedron, the running time for our general algorithm is 
0(n 6 logn).) 

We have also shown that the Maximum TSP under Euclidean norm in M. d is NP-hard for any 
fixed d > 3. This shows that the complexity of an optimization problem is not just a consequence 
of its combinatorial structure or its geometry, but may be ruled by the structure of the particular 
distance function that is used. The result has similar implications for closely related problems. 

The Euclidean case d = 2 remains open; in the light of our results, it seems more likely that 
this problem is NP-hard, even though its counterpart with rectilinear distances turned out to 
be extremely simple. However, it is much harder to use strictly local arguments for geometric 
maximization problems, so a proof of NP-hardness may have to use a more involved construction. 

Conjecture 30 The Maximum TSP for Euclidean distances in the plane is an NP-hard problem. 
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